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Abstract 

We study the avoidability of long fc-abelian-squares and fc-abelian- 
cubes on binary and ternary alphabets. For k = 1, these are Makela’s 
questions. We show that one cannot avoid abelian-cubes of abelian period 
at least 2 in infinite binary words, and therefore answering negatively one 
question from Makela. Then we show that one can avoid 3-abelian-squares 
of period at least 3 in infinite binary words and 2-abelian-squares of period 
at least 2 in infinite ternary words. Finally we study the minimum number 
of distinct fc-abelian-squares that must appear in an infinite binary word. 


1 Introduction 

Avoidability of structures and patterns has been extensively studied in theoret¬ 
ical computer science since the work of Thue on avoidability of repetitions in 
words m ■ Thue showed that there are infinitely long ternary words avoiding 
squares (factors of the form ww where w is a word) and infinitely long binary 
words avoiding cubes (factors of the form www where w is a word). 

The avoidability of abelian repetitions has been studied since a question from 
Erdos in 1957 j3[6j. A factor uv is an abelian-square if u is a permutation of 
the letters of v. Erdos asked whether it is possible to avoid abelian-squares in 
an infinite word over an alphabet of size 4. (Abelian-square-free ternary words 
have a length of at most 7.) After some intermediary results (alphabet of size 
25 by Evdokimov [7j and size 5 by Pleasant m ), Keranen answered positively 
Erdos’s question by giving a 85-uniform morphism (found with the assistance 
of a computer) whose fixed-point is abelian-square-free |10| . Moreover, Dekking 
showed that it is possible to avoid abelian-cubes on a ternary alphabet and 
abelian-4th-powers over a binary alphabet [3]. 

Erdos also asked if it is possible to avoid arbitrarily long ordinary squares 
on binary words. This question was answered positively by Entringer, Jackson 
and Schatz [4] . Makela asked the following two questions about the avoidability 
of long abelian-cubes (resp. squares) on a binary (resp. ternary) alphabet: 

Question 1 (Makela (see jl 1] )) ■ Can you avoid abelian-cubes of the form uvw 
where |w| > 2, over two letters ? - You can do this at least for words of length 
250 . 
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Question 2 (Makela (see I IT])). Can you avoid abelian-squares of the form uv 
where |u| > 2 over three letters ? - Computer experiments show that you can 
avoid these patterns at least in words of length 450. 

The notion of fc-abelian repetition has been introduced recently by Karhumaki 
et al. as a generalization of both repetition and abelian repetition [9] . One can 
avoid 3-abelian-squares (resp. 2-abelian-cubes) on ternary (resp. binary) words 
M- Following Erdos’s and Makela’s questions, one can ask whether it is pos¬ 
sible to avoid long fc-abelian-powers on binary (resp. ternary) words. 

In Section [3l we answer negatively Makela’s Question [L] and we propose 
a new version of the conjecture. In Section U we show that one can avoid 
3-abelian-squares of period at least 3 in binary words and 2-abelian-squares 
of period at least 2 in ternary words. In Section [5l we study the minimum 
number of distinct fc-abelian-squares that must appear in an infinite binary 
word. Finally, in Section [SJ we explain the computer searches we use to find the 
morphisms of Section [4] and Section [5] 

2 Preliminaries and definitions 

We use terminology and notations of Lothaire [TJj. Let E be a finite alphabet. 
For a word u £ E* and a £ E, we denote by |it| a = |{* : u[i\ = a}| the number 
of occurrences of the letter a in u. For u,w £ E*, we denote by |u|u, = |{* : u[i : 
i + |tc| — 1] = w}| the number of occurrences of the factor w in u. 

Two words u and v are said to be abelian equivalent , denoted u « a v, if for 
every a £ E, \u\ a = |u| Q , and they are said k-abelian equivalent (for k > 1), 
denoted u ~ a ,k v, if for every w £ E* such that |w| < fc, |u| w = |u| w . A 
word ii\U 2 ■ ■ -u n is a k-abelian-n-power if it is non-empty, and U\ ~ atk U 2 ~ a .fc 
... ~ a ,k u n . Its period is |iti|. Similarly, a k-abelian-square (resp. k-abelian- 
cube) is a fc-abelian-2-power (resp. fc-abelian-3-power). A word is said to be 
k-abelian-n-power-free if none of its factors is a fc-abelian-n-power. Note that 
when k = 1, the fc-abelian-equivalence is exactly the abelian equivalence, and 
we ornmit the “1-” prefix in this case. 

The Parikh vector of a word w £ E*, denoted ^(w), is the vector indexed 
by E such that for every a £ E, ^(w;)[a] = |w| a - Then, by definition, two words 
u and v are abelian-equivalent if ’F(u) = 'F(u). For a set S C E* and a word 
w £ E*, we denote by 'Fs(ui) the vector indexed by S such that for every s £ S, 
’Fs(u;)[s] = |w;| s . We may write 'F/ C (’u;) instead of '&- S k ( w ) if E is clear in the 
context. 

For all u £ E*, i < |u|, let prefj(u) be the prefix of size i of u and sufj(u) be 
the suffix of size i of u. There are equivalent definitions of fc-abelian equivalence 
(see [9])- Two words of size at most 2fc — 1 are fc-abelian equivalent if and only if 
they are equal. For every two words u and v of size at least fc — 1, the following 
conditions are equivalent: 

• u and v are fc-abelian equivalent (i.e. u ~ a ,k v), 

• ty k (u) = V k (v) and pref^fiu) = pref^fiw), 
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• 4 < k {u) = ^ k {v) and suf fc _i(u) = suf fc _i(u). 


3 Abelian cubes and Makela’s Question [1] 

Dekking showed that it is possible to avoid abelian-cubes in an infinite word 
over a ternary alphabet [3]. More recently Rao showed that one can avoid 2- 
abelian-cubes over a binary alphabet m and one can check that every word 
over a binary alphabet of length greater than 9 contains an abelian-cube. The 
only open question about the avoidability of long /c-abelian-cubes on infinite 
words is then the avoidability of long abelian-cubes over the binary alphabet. 
This is the subject of Question [T] from Makela: he asked whether one can avoid 
every abelian-cubes of period at least 2 in binary words. We answer negatively 
this question. 

For this, we used a property of Lyndon words that made the exhaustive 
search much faster. A word w £ E* is a Lyndon word if for all u, v £ £ + such 
that w = uv, w <ie X vu, where <i ex is the lexicographic order. The well known 
Chen-Fox-Lyndon Theorem states that every word may be written uniquely 
as a concatenation of non-increasing Lyndon words (see for example [121 1. In 
the following, we refer to this decomposition as the Lyndon factorization. A 
language L C £* is factorial if for every w in L 1 every factor of w is in L. 

Lemma 1. Any factorial language L with arbitrarily long words contains arbi¬ 
trarily long Lyndon words or repetitions of arbitrarily large power. 

Proof. Let us assume that there are no arbitrarily long Lyndon words in L. 
This implies that there is a finite number n of Lyndon words in L and s £ N 
such that for every Lyndon word w in L, |w;| < s. Let w ±,..., w n £ L n be the 
Lyndon words of L ordered by decreasing lexicographic order. 

Then using the Lyndon factorization for every w £ L there are some Lyndon 
words L\ >i ex L 2 >iei ■ ■ ■ >iex Ld such that w = L\ ... Ld- The fact that our 
language is factorial tells us that all the Li are in L. We get that for every w £ L 
there are a\,... ,a n £ N such that w = wf 1 ... . Then |w| < ££ |wj| xa; < 

s x 'Yf i cti < s x n x nraxi(ai). 

Since L contains arbitrarily long words, then for any t £ N there is a w £ L 
such that |w| > t x s x n. Let j £ {1,...,n} be such that ay = max;(ai). Thus 
aj > t and then ( Wj )* £ L. Thus we have arbitrarily long powers in L. □ 

A set of words that avoid certain kind of abelian repetitions is a factorial 
language and does not contain arbitrarily large powers. Thus we just need to 
check that there are no arbitrarily long Lyndon words in it to deduce that this 
set does not contain arbitrarily long words. The exhaustive search on prefixes 
of Lyndon words is then much shorter. Figure |T] shows how it helps for the 
exhaustive search of binary words avoiding abelian-squares of period at least 2. 
The next Proposition answers negatively Question [TJ 

Proposition 1. There is no infinite word over a binary alphabet avoiding 
abelian-cubes of period at least 2. 
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000-0001^ 

/ 00010-000100-0001000 

10 

\ ^0011 —00111 001110 0011100 —00111000 001110001 —0011100011 

001^ 

"ooio — 00100-001000 

100 - 1000 — 10001 - 100011 - 1000111 — 10001110 - 100011100 

/ 


101-1011 —10111 

11100 - 111000 - 1110001 — 11100011 - 111000111 

111 - 1110 ^ 

/ 11101-111011-1110111 

1 

^1100 — 11000 110001 1100011 — 11000111 110001110 — 1100011100 

no'’" 

^1101 — 11011 - 110111 


011-0111 — OHIO 


\ 

010 


ooo-oooi: 


,00011 — 000111 — 0001110 — 00011100 - 000111000 

~ 00010 — 000100 — 0001000 


£ ^0011 — 00111 — 001110 — 0011100 

001^ 

\ "ooio — 00100 — 001000 


1 -11 - 111 


Figure 1: Top: the exhaustive search of binary words avoiding abelian-squares 
of period at least 2. Bottom: the same exhaustive search restricted on the 
prefixes of Lyndon words. 
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We checked using a computer program that there are only finitely many 
Lyndon words over a binary alphabet avoiding abelian-cubes of period at least 2. 
The program took approximately 3 hours to find all 2 732 711352 such Lyndon 
words and prefixes of Lyndon words. The longest word has a length of 290. 
Using Lemma|T|we deduce that there is no infinite binary word avoiding abelian- 
cubes of size at least two. Then we can reformulate the question and ask: 

Question 3. Is there a p G N such that one can avoid abelian-cubes of period 
at least p over two letters ? 

For p = 3, we found a word of lenght 2 500. 


4 Avoiding long /c-abelian-squares on binary words 

It is easy to verify that one cannot avoid squares of period at least 2 over a 
binary alphabet. Entringer et al. showed that it is possible to construct a 
binary word avoiding squares of period at least 3 (Tj. They also showed that 
every infinite binary word contains arbitrarily long abelian-squares. Thus, one 
can wonder about the avoidability of large fc-abelian-squares. Rao asked the 
following question: 

Question 4 (Rao |14|~). What is the smallest k (if any) such that arbitrarily 
long k-abelian-squares can be avoided over a binary alphabet ? 

Since arbitrarily long abelian squares cannot be avoided in binary words, k 
is at least 2. In this section, we show that k is at most 3, that is one can avoid 
long 3-abelian-squares over a binary alphabet, by giving a morphism whose fixed 
point avoids 3-abelian-squares of period at least 3. 

A morphism h is said (p,k)~ abelian-square-free if for every abelian-square- 
free word w. h(w) avoids fc-abelian-squares of period at least p. Let h be the 
following morphism: 

! 0 -> 00001101010 
1 -4 00011111010 
2 -4 00110100110 
34 00111001010 . 

Theorem 1. The morphism h is (3,3)-abelian-square-free. 

Proof. The proof is based on the same idea as the one used by Rao to give 
sufficient conditions for a morphism to be fc-abelian-free m which is a gener¬ 
alization of the sufficient conditions given by Carpi for abelian-free-morphisms 
[1]. We checked the sufficient conditions on h by computer. 

In the proof we use the following property: for every fc, i G N and u, v G E* 
such that i < fc, k— 1— i < |u| and i < |u|: 

'Lfc(itu) = pref i (u))+4'fe(suffc_i_ ! ;(u)u). (S) 

Let A = {0,1, 2, 3} and w G A*. Let us show that if h(w) contains a 3- 
abelian-square of period at least 3, then w is not abelian-square-free. Note that 
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every image h[x), x G A, starts with the prefix p = 00 and ends with the suffix 
s = 10. 

We check using a computer that for every a,b G A, a ^ b, every 3-abelian- 
square of h(ab ) has period at most 2. So if there is a forbidden 3-abelian-square, 
it has to be on the image of at least 3 letters. Then there are ai,a 2 ,a 3 G A, 
xi,x 2 G A* and («i, «i), {u 2 ,v 2 ), {u 3 ,v 3 ) G ({0,1}*, {0,1} + ) such that: 

• ( 11 X 1 ( 12 X 2(13 is a factor of w, 

• for every i G {1, 2, 3}, UiVi = h{at), 

• V\h{xi)u 2 ~a ,3 v 2 h(x 2 )u 3 . 

Since \viu 2 v 2 u 3 \ > l+|/i(a2)| > 12, either \v\u 2 \ > 6 or \v 2 u 3 \ > 6 . More¬ 
over, \v\h{x\)u 2 \ = \v 2 h(x 2 )u 3 \, thus for all i G {1,2}, \vih(xi)iii+i\ > 6, and 
for all i G {1,2}, \vi\ > 2, \h(xi)\ > 2 or > 2. If |rq+i| > 2, then: 

'S' 3 (v i h(xi)u i+ 1) 

= ^’ 3 (vi 00 )+'b 3 {h(xi)u i+1 ) (using (0 and pref 2 (/i(x l )u l+ i) = 00) 

= T 3 (u l 00)+^ 3 (/i(:r i )00)-|-’I'3(ui + i) (using 0 and pref 2 (u i+ i) = 00) 

= T 3 (u l 00)+«' 3 (/i(a:i)00)+ 4, 3(10n l +i)-^ 3 (1000). 

If |uj| > 2 or \h(xi)\ > 2, we have the same result. So for every i G {1,2}, we 
get: 


'S>3(v i h(xi)ui + 1 ) = tk 3 (i>i00)+\& 3 (/i(x,)00)+\I/ 3 (10u i+ i)-’]/ 3 (1000). (L) 

Let N be the matrix indexed by {0, l} 3 x{0,1, 2, 3} with N[w, x\ = |/i(a:)00| u; . 

/3 1 2 1 1 2 1 0\ 
t 2 1111113 
12 12 2 12 0 
\1 2212111 / 

For every word w, 'L 3 (h(w)00) = N^f(w), thus the equality (0 can be 
rewritten: 


'b 3 (v i h(x i )u i+ i) = T 3 (w i 00)+iVd'(a;j)+'I' 3 (10it i+ i)-T 3 (1000). 

Using v\h(xi)u 2 « a ,3 V 2 h(x 2 )u 3 , we get the following: 

N(^(x 2 )-^(x 1 )) = ^ 3 (v 1 00)+^ 3 (10u2)-^ 3 {v 2 00)-^ 3 (10u 3 ). ( 1 ) 

Let M be the sub-matrix of N made of its rows 1,2,3 and 4 (they correspond 
to the words 000,001, 010,011). 

/3 2 1 1\ 

M= 1 1 2 2 

2 112 

\1 1 2 l/ 
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Let S = {000,001,010,011}. Then 't's('U)) is the sub-vector of ^ 3 (w) made of 
the rows 1, 2, 3 and 4. We can check that M is non-singular. Thus: 

4'0 2 )-’I'(zi) = M- 1 (4's(ni00)+4's(10M2)-4's(^200)-4's(10u 3 )). (2) 

Let: 

^ s{vi,u 2 ,v 2 ,u 3 ) ='L5(ni00)+4's(10rt2)—'Ls(r' 200 )—4'5(10rt3), and 

^3{vi,U2,v 2 ,u 3 ) = 'k 3 (ni00)+4 , 3 (10it2)— v f , 3 (t! 2 00)—4 , 3(10it 3 ). 

From equation {T}, u 2 , v 2 , u 3 ) is in Im(7V), and from equation {2]), 

s(vi, u 2 , v 2 , M 3 )) is an integer vector. From v\h{xi)u 2 « a ,3 v 2 h(x 2 )u 3 , 
if we note p = 00, s = 10 and k = 3, we get pref fc _ 1 (t>ip) = pref fc _ 1 (n 2 p), 
suf fc _i(sw 2 ) = suf fc -i(sw3)- 

The following claim is verified using a computer program. There are 4 3 
values for the cq and ll 3 ways of choosing the Ui,Vi for each of them which 
makes 85 184 cases to check (most of them are eliminated by the prefix and 
suffix conditions). 

Claim 1. For all ai, a 2 , a 3 G A and (ui, zq), ( u 2 , v 2 ), ( 03 , v 3 ) G ({0,1}*, {0,1} + ) 
such that: 

• Vi G {1,2,3}, UiVi = h(ai), 

• pref^^vip) = pref k _ 1 (v 2 p) and suf fc _i (su 2 ) = suf fc _i (su 3 ), 

• tyk(vi,U 2 ,v 2 ,u 3 ) G Im(.ZV’) and s(vi,u 2 ,v 2 ,u 3 )) is an integer vec¬ 

tor, 

there are (a±, a 2 , ct 3 ) G {0,1} such that: 


M 1 ('5’ s { vi , u 2 , v 2 , u 3 )) = ai4>(ai)-(2a 2 -l)4 , (a 2 )-(l-a'3)4i(a3). (E) 

From the claim we have (on, 02 , 0 : 3 ) G {0,1} such that equation (JE]) is ful¬ 
filled. Now we can introduce xi,x' 2 such that: Vi G {1,2}, x\ = a“ i Xia i ,“ >+1 . 
Then x^x^ is a factor of w and we have the following. 

’F(2/ 2 )-'F(a/ 1 ) = ’L(x 2 )-)-(l—03) 4/(03 )+a2'L(a 2 ) —(4 1 (xi)-|-(l—o 2 ) ^(02 )+o:i'L(ai)) 
= M~ 1 (^s(vi,u 2 ,V2,u 3 ))-a 1 ^(ai)+(2a 2 -l)'it(a2)+{l-a 3 )^{a 3 ) 
4'(4)-4'(x' 1 ) =0 

This implies that there is an abelian-square on w. □ 

Using Theorem Q] together with the existence of abelian-square-free words 
over four letters, we get the following corollary. 

Corollary 1. There is an infinite binary word that avoids 3-abelian-squares of 
period at least 3. 
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Moreover, we can deduce the exponential growth of such words from the 
exponential growth of abelian-square-free words over four letters 0. Corollary [T] 
gives a partial answer to Question 0J there is such a k, and k £ {2,3}. We can 
then ask the following question. 

Question 5. Can we avoid 2-abelian-squares of period at least p on the binary 
alphabet, for some p £ N ? 

Computer experiments show that we can avoid those patterns for p = 3 in 
a word of length 15 000. 

2-abelian squares over a ternary alphabet 

Rao showed that one can build an infinite word that avoids 3-abelian-squares 
over a ternary alphabet |14j . The longest 2-abelian-square-free ternary word 
has a length of 537 [5]. Makela asked whether we can avoid abelian-squares of 
period at least 2 in ternary words (Question [2}. We give the answer to a weaker 
version of this question, that is one can avoid 2-abelian-squares of period at 
least 2 over the ternary alphabet. Let: 



0 -> 00021 
1 -> 00111 
2 -> 01121 
3 -> 01221. 


Theorem 2. /12 is (2, 2) -abelian-square-free. 

Proof. The proof is also done by checking sufficient conditions, similar to those 



If w is an infinite square-free-abelian word over four letters, /i 2 (ic) is a ternary 
word which avoid 2-abelian-squares of period at least 2. 

5 Minimal number of distinct 3-abelian-squares 
in infinite binary words 

Fraenkel and Simpson showed that there is an infinite binary word containing 
only the squares 0 2 , l 2 , (01) 2 |Sj. Moreover, every binary infinite word contains 
at least three distinct squares. It is natural to ask whether this property can be 
extended to the fc-abelian case: is there a k £ N such that there is an infinite 
binary word that contains only 3 distinct fc-abelian-squares? 

More generally let g(k) be the minimal number of distinct fc-abelian squares 
that an infinite binary word must contain. Any (h + l)-abelian-square is a k- 
abelian-square so g is non-increasing. From Fraenkel and Simpson’s result, we 
know that g{k) > 3 for all k. 
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' 0 -»• tjlOOlOllOOOlOlllOOOllOOlOllOOOlOlllOOlOllOOOlllOOlOlllOOOllOOlOHOOO 
10111001011001110001011000111001011100011001011000101110010110001110 
01011100011001011000111001011001110001100101100011100101110001100101 
10001011100101100011100101110001100101100011100101100111000101100011 
10010110001011100101100111000101110010110001011100011001011000111001 
011001110001011100101100011100101110001 It! 

1 -*• UOOOIIIOOIOIIOOIIIOOOIIOOIOIIOOOIIIOOIOIIOOIIIOOOIOIIIOOIOIIOOOIHOO 
10111000110010110001110010110011100011001011000111001011100010110001 
11001011001110001100101100011100101100111000101100011100101100010111 
00011001011000111001011001110001100101100011100101110001100101100010 
11100101100011100101110001100101100011100101100111000110010110001110 
0101110001100101100010111001011001110 ?; 

3 : I 2 —► UOOOIIIOOIOIIOOIIIOOOIIOOIOIIOOOIIIOOIOIIOOIIIOOOIOIIIOOIOIIOOOIHOO 
10111000110010110001011100101100111000101100011100101100010111001011 
00111000101110010110001110010111000110010110001011100101100111000101 
11001011000101110001100101100010111001011001110001011000111001011001 
11000110010110001110010111000110010110001011100101100111000101110010 
110001011100011001011000101110010110011100011 ^ 

3 ->• itOOOlllOOlOllOOOlOlllOOOllOOlOllOOOlOlllOOlOllOOOlllOOlOlllOOOHOOlO 
11000111001011001110001100101100011100101110001011000111001011001110 
00110010110001110010110011100010110001110010110001011100011001011000 
10111001011001110001011000111001011100010110011100011001011000111001 
01100111000101100011100101100010111001011001110001011100101100011100 
101110001100101100010111001011001110 ?; 

Where: 


U = 11000110010110001011100101100111000101100011100101100 

01011100101100111000101110010110001011100011001011000 
111001011001110001100101100010111001011001110001011 
v = 00101100011100101100111000110010111000101100111000101 

11001011000101110001100101110001011001110001100101100 
01110010111000101100011100101100111000101100011100101 


Table 1 : A ( 5 , 3 )-abelian-square-free morphism, with only three distinct 5 - 
abelian-squares: 00, 11 and 0101. 
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Proposition 2. The morphism /13 (defined in Table\Jf) is ( 5 , 3 ) -abelian-square- 
free. Moreover, for every abelian-square-free word w, hs(w) contains only 3 
distinct 5-abelian-squares: 0 2 , l 2 and (01) 2 . 

Proof. The proof that /13 is ( 5 , 3 )-abelian-square-free is also done by a computer 
check, similar to the proof of the Theorem [ 1 ] This morphism is not uniform, so 
we need to check images of words of size up to 3 to ensure that the image of a 
long 5 -abelian square starts and ends on images of different letters. □ 


This proposition together with Keranen’s word tells us that for any k > 
5 there is an infinite binary word with only 3 distinct fc-abelian-squares, i.e. 
g{k) = 3 for every k > 5 . Propositions [ 3 ] and 0] give us that g( 3 ) = g( 4 ) = 4 . 

Proposition 3. Every word of size more than 87 over the binary alphabet con¬ 
tains at least 4 distinct 4 ~abelian-squares. 

This was verified by an exhaustive computer search. 


Proposition 4. Let: 


/14 : 


0 0001100101001101011000101010001011101011000101 

1 -t 0001100101001101011001110101011100011101011000101 

2 0001100101001110001010001100101100011101011000101 

3 000110010100111001010100111000101100101011000101 . 


Then /14 is ( 3 , 3 )- abelian-square-free. Moreover, for every abelian-square-free 
word w, h±(w) contains only 4 distinct 3-abelian-squares: 0 2 , l 2 , (01) 2 and 
( 10 ) 2 . 

The proof that h .4 is ( 3 , 3 )-abelian-square-free is also done by a computer 
check, similar to the proof of the Theorem [l] One can then check that ( 00 ) 2 
and (11) 2 do not appear as factors of any image of a two-letter word. 

Finally using again an exhaustive search we were able to give the lower bound 
3 ( 2 ) > 5 . 

Proposition 5. Every word of size more than 92 over the binary alphabet con¬ 
tains at least 5 distinct 2-abelian-squares. 

The following question is a stronger version of Question [ 5 j 

Question 6. How many distinct 2-abelian-squares must an infinite binary word 
contain? 


6 Computer searches 

The toughest part to prove the existence of an infinite word with a desired 
property (say, the property P) is to find a morphism with sufficient conditions 
whose fixed point has the property P. Let W be the family of finite words 
with the property P. This part was done by a computer-assisted search, as 
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follows. We look for a morphism whose images share a long common prefix and 
a long common suffix, to avoid the creation of small forbidden patterns when 
we concatenate two images. For this, we first selected one “good” factor w = uv 
(with |u| = 1171 ), where v will be the common prefix, and u the common suffix. 
To optimize the chance of success, we took a factor w which is supposed to 
appear often in words in W: we constructed, with a backtrack algorithm, a long 
random word in W, and selected the factor w among the factors which appear 
most often is this word. Then we constructed a family T w of words in W with 
prefix u, suffix v, and with a size of at most a fixed number. (If the search fails, 
we try again with another w, or with a larger size.) We constructed the graph 
with vertex set T w , and edges {x,y} such that xyx and yxy are in W. Finally, 
we checked the sufficient conditions on every morphism which corresponds to a 
clique of size 4 in the graph. 

As an example, for the morphism /13, which was the most arduous to find 
among the morphisms presented here, we use the following parameters. We 
selected the 88 factors with a size of 300 which appear most often among ap¬ 
proximately 1.2 million factors we found in the random word. We computed the 
families J- w with words of size at most 700. All the families T w we computed 
had a size between 1000 and 5 000. The graph of the family that gave us the 
morphism /13 had 58 680 edges and 1977 vertices (density of ^0.03). With the 
right parameters (|u>| and length of the elements of T w ) it took half a day to 
find a good morphism. 

The presented approach cannot be directly used to answer positively to 
QuestionEJ For every binary word w, |wx|io = |wx|oi (where x is the first letter 
of w). So the matrix N has rank at most 3, and one cannot find an invertible 
sub-matrix M of size 4. Thus Question[5]is, in spirit, close to Makela’s questions 
(Question [ 2 ] and Question [3l the modified version of Question [TJ . 
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